Vector library cleanup #473

solidpixel · 2024-06-07T20:41:10Z

The astcenc vector library effectively implements two different class APIs:

An explicit 4-wide API which is used via explicit 4-wide types (e.g. vfloat4) in the codec.
A vector-length agnostic API, which is used as N-wide types in the codec (e.g. vfloat) in the codec, and where the width is resolved at compile time.

For historical reasons the classes that are only used as a VLA classes (e.g. vfloat8 for AVX2) implement a lot of functionality which was inherited from the original 4-wide implementation and not actually used in the VLA parts of the codec. This makes adding new VLA implementation (e.g. Arm SVE) more expensive than it needs to be.

This PR doesn't add SVE support, but does some cleanup to minimize the vector library API as a precursor to doing so. The main changes are:

Remove VLA indexable lane read functions, as with true VLA code the lane count isn't known.
Replace VLA use of .lane<0>() with dedicated scalar function returns e.g. use hmax_s() rather than hmax.lane<0>(). This was beeing done in places before, but was not done consistently. Now this pattern is used everywhere.

bengaineyarm · 2024-07-17T14:15:46Z

Source/astcenc_pick_best_endpoint_format.cpp

@@ -1307,7 +1307,7 @@ unsigned int compute_ideal_endpoint_formats(
 vmask lanes_min_error = vbest_ep_error == hmin(vbest_ep_error);
 vbest_error_index = select(vint(0x7FFFFFFF), vbest_error_index, lanes_min_error);
 vbest_error_index = hmin(vbest_error_index);
- int best_error_index = vbest_error_index.lane<0>();
+ int best_error_index = vbest_error_index.lane0();


You squished out the other lane0 accesses into hmax_s etc... any reason not to wrap this one?

Nevermind you fixed it later

bengaineyarm · 2024-07-17T14:18:06Z

Source/UnitTest/test_simd.cpp

@@ -169,8 +203,8 @@ TEST(vfloat, ChangeSign)
 /** @brief Test VLA atan. */
 TEST(vfloat, Atan)
 {
- vfloat a(-0.15f, 0.0f, 0.9f, 2.1f);
- vfloat r = atan(a);
+ vfloa4 a(-0.15f, 0.0f, 0.9f, 2.1f);


Nevermind you fixed it later

bengaineyarm

LGTM

solidpixel added 2 commits June 7, 2024 13:24

Fix comment typos in AVX2 header

1f03fdf

Remove arbitrary lane access from VLA code

d608da5

solidpixel marked this pull request as draft June 7, 2024 20:41

solidpixel self-assigned this Jun 7, 2024

solidpixel added this to the 5.0.0 milestone Jun 7, 2024

Remove unused clampz

d34147f

solidpixel force-pushed the vla_cleanup branch from f33be82 to d34147f Compare June 7, 2024 20:43

Use scalar functions where needed

eb21c08

solidpixel modified the milestones: 5.0.0, 4.9.0 Jun 7, 2024

solidpixel added 3 commits July 1, 2024 12:49

Remove vfloat*::lane_id() functions

2758f64

Remove 8-wide literal loads

be619f8

Remove unused gatheri

f648880

solidpixel changed the title ~~VLA vector library cleanup~~ Vector library cleanup Jul 2, 2024

Fix unit test typo

08026e7

solidpixel requested a review from bengaineyarm July 2, 2024 08:01

solidpixel marked this pull request as ready for review July 2, 2024 08:01

Merge branch 'main' into vla_cleanup

508df78

solidpixel mentioned this pull request Jul 2, 2024

Add Arm SVE 8-wide (256b) implementation #480

Merged

Merge branch 'main' into vla_cleanup

576ab67

bengaineyarm reviewed Jul 17, 2024

View reviewed changes

bengaineyarm approved these changes Jul 17, 2024

View reviewed changes

solidpixel merged commit 69bc17b into main Jul 17, 2024
4 checks passed

solidpixel deleted the vla_cleanup branch July 17, 2024 14:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vector library cleanup #473

Vector library cleanup #473

solidpixel commented Jun 7, 2024 •

edited

Loading

bengaineyarm Jul 17, 2024

bengaineyarm Jul 17, 2024

bengaineyarm Jul 17, 2024

bengaineyarm Jul 17, 2024

bengaineyarm left a comment

Vector library cleanup #473

Vector library cleanup #473

Conversation

solidpixel commented Jun 7, 2024 • edited Loading

bengaineyarm Jul 17, 2024

Choose a reason for hiding this comment

bengaineyarm Jul 17, 2024

Choose a reason for hiding this comment

bengaineyarm Jul 17, 2024

Choose a reason for hiding this comment

bengaineyarm Jul 17, 2024

Choose a reason for hiding this comment

bengaineyarm left a comment

Choose a reason for hiding this comment

solidpixel commented Jun 7, 2024 •

edited

Loading